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GenCore version 6.2.1 
Copyright (c) 1993 - 2008 Biocceleration Ltd. 



OM protein - protein search, using sw model 



September 18, 2008, 22:00:18 ; Search time 41 Seconds 
(without alignments) 
1537.121 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-09-961-086A-1 
3352 

1 MSSSNVEVFIPVSQGNTNGF MIVIFLTIAYLKLLFLKKYS 655 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



PIR_80 : * 
pir 1 : * 
pir2:* 
pir3 : * 
pir 4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result Query 

No. Score Match Length DB ID 



1 2849.5 85.0 

2 835.5 24.9 



656 2 JC7860 
1049 1 S19421 



brain multidrug re 
ATP-dependent perm 
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ALIGNMENTS 



RESULT 1 
JC7860 

brain multidrug resistance protein, BMDP - pig 
C; Species: Sus scrofa domestica (domestic pig) 

C;Date: 18-Nov-2002 #sequence_revision 18-Nov-2002 #text_change 09-Jul-2004 
C;Accession: JC7860 
R;Eisenblaetter, T.; Galla, H.J. 

Biochem. Biophys . Res. Commun . 293, 1273-1278, 2002 

A; Title: A new multidrug resistance protein at the blood-brain barrier. 
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A;Reference number: JC7860; MUID : 22 050 12 7 ; PMID : 12 05451 4 
A;Accession: JC7860 
A; Molecule type: mRNA 
A;Residues: 1-656 <EIS> 

A;Cross-references : UNIPROT : Q8MIB3 ; UNIPARC : UPI 0 00 0 08 7EC5 ; GB:AJ420927 
A; Experimental source: brain 

C; Comment: This protein, a new transport protein of the ATP-binding cassette (ABC) 
superfamily of transporters, expressed in porcine brain capillary endothelial cells, plays 
an importnat role in the exclusion of xenobiotics from the brain and participates in drug 
transport across the blood-brain barrier and therefore is considered as a efflux pump at 
the cerebral endothelium. 
C; Genetics : 
A; Gene: bmdp 

Query Match 85.0%; Score 2849.5; DB 2; Length 656; 

Best Local Similarity 84.3%; Pred. No. 2.7e-192; 

Matches 553; Conservative 44; Mismatches 58; Indels 1; Gaps 1; 



1 MSSSNVEVFIPVSQGNTNGFPATASNDLKAFTEGAVLSFHNICYRVKLKSGFLPCRKPVE 
III:: : I I I : I : I I I I I : : I I : I I I I I I I I I : I I I I I I : I I I I I III II 

1 MSSNSYQVSIPMSKRNTNGLPGSSSNELKTSAGGAVLSFHDICYRVKVKSGFLFCRKTVE 



61 KEILSNINGIMKPGLNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLINGAPRPANFKCN 120 

I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
61 KEILTNINGIMKPGLNAILGPTGGGKSSLLDVLAARKDPHGLSGDVLINGAPRPANFKCN 120 



121 SGYVVQDDVVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVADSKVGT 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 SGYVVQDDVVMGTLTVRENLQFSAALRLPTTMTNHEKNERINMVIQELGLDKVADSKVGT 180 



181 QFIRGVSGGERKRTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQGRTIIF 2 40 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
181 QFIRGVSGGERKRTSIAMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQGRTIIF 2 40 



241 SIHQPRYSIFKLFDSLTLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDIING 3 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I 11:11 1111111111:111 
241 SIHQPRYSIFKLFDSLTLLASGRLMFHGPAREALGYFASIGYNCEPYNNPADFFLDVING 3 00 



301 DSTAVALNR-EEDFKATEIIEPSKQDKPLIEKLAEIYVNSSFYKETKAELHQLSGGEKKK 3 59 

11:111:1:1 II I I :: I I I I : I I I I I I I I : I : I I I I I I I I I I I 
301 DSSAVVLSRADRDEGAQEPEEPPEKDTPLIDKLAAFYTNSSFFKDTKVELDQFSGGRKKK 360 



36 0 KITVFKEISYTTSFCHQLRWVSKRSFKNLLGNPQASIAQIIVTWLGLVIGAIYFGLKND 419 

I : I : I I :: I I I I I I I I I I I : I : I I I I I I I I I I I I I : I I I I I I :: I I I I I I I I : : I I I I 
361 KSSVYKEVTYTTSFCHQLRWISRRSFKNLLGNPQASVAQIIVTIILGLVIGAIFYDLKND 420 



42 0 STGIQNRAGVLFFLTTNQCFSSVSAVELFVVEKKLFIHEYISGYYRVSSYFLGKLLSDLL 4 79 

: I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
421 PSGIQNRAGVLFFLTTNQCFSSVSAVELLVVEKKLFIHEYISGYYRVSSYFFGKLLSDLL 4 80 



48 0 PMTMLPS IIFTCI VYFMLGLKPKADAFFVMMFTLMMVAYSASSMALAIAAGQS VVSVATL 53 9 

II I I I I I I I I I I 11:11111 : I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
481 PMRMLPS IIFTCI TYFLLGLKPAVGSFFIMMFTLMMVAYSASSMALAIAAGQSVVSVATL 5 40 



54 0 LMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTALQHNEFLGQNFCPGLNATG 5 99 

I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I : I I I : I I I I I I I I I I I I I I 
541 LMTISFVFMMIFSGLLVNLKTVVPWLSWLQYFSIPRYGFSALQYNEFLGQNFCPGLNVTT 6 00 
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Qy 600 NNPCNYATCTGEEYLVKQGIDLSPWGLWKNHVALACMIVIFLTIAYLKLLFLKKYS 655 

II I : : I III III III II I I I I : I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 601 NNTCSFAICTGAEYLENQGISLSAWGLWQNHVALACMMVIFLTIAYLKLLLLKKYS 656 



RESULT 2 
S19421 

ATP-dependent permease ADPl precursor - yeast ( Saccharomyces cerevisiae) 
N; Alternate names: protein YCROllc; protein YCR105 
C; Species: Saccharomyces cerevisiae 

C;Date: 31-Mar-1992 #sequence_revision 31-Mar-1992 #text_change 09-Jul-2004 
C;Accession: S19421; S40914 
R;Goffeau, A.; Purnelle, B.; Skala, J. 

submitted to the Protein Sequence Database, March 1992 
A;Reference number: S19420 
A;Accession: S19421 
A;Molecule type: DNA 
A;Residues: 1-1049 <GOF> 

A;Cross-references : UNIPROT : P253 7 1 ; UNIPARC : UPI 0 00 0 1255FD ; EMBL:X59720; NID : gl 90 71 1 6 ; PIDN: 
CAA42328.1; PID : gl 9 0 7 15 4 ; GSPDB : GN00003 ; MIPS:YCR011c 
R;Purnelle, B.; Skala, J.; Goffeau, A. 
Yeast 7, 867-872, 1991 

A; Title: The product of the YCR105 gene located on the chromosome III from Saccharomyces 
cerevisiae presents homologies to ATP-dependent permeases. 
A;Reference number: S40914; MUID : 92160395; PMID:1789009 
A;Accession: S40914 

A; Status: not compared with conceptual translation 
A;Molecule type: DNA 
A;Residues: 1-1049 <PUR> 

A;Cross-references : UNIPARC : UPI 00 00 1 255FD 
R;Skala, J.; Purnelle, B.; Goffeau, A. 
Yeast 8, 409-417, 1992 

A; Title: The complete sequence of a 10.8 kb segment distal of SUF2 on the right arm of 
chromosome III from Saccharomyces cerevisiae reveals seven open reading frames including 
the RVS161, ADPl and PGK genes. 

A;Reference number: S25353; MUID : 92327849; PMID:1626432 
A; Contents: annotation 
C ; Genetics : 

A;Gene: SGD:ADP1; MIPS:YCR011c 

A;Cross-references : SGD : S0000604; MIPS:YCR011c 
A;Map position: 3R 

C; Superf amily : ATP-dependent permease ADPl; ATP-binding cassette homology 
C;Keywords: ATP; glycoprotein; nucleotide binding; P-loop; transmembrane protein 
F; 1-25/Domain : signal sequence #status predicted <SIG> 

F; 26-1049/Product : ATP-dependent permease ADPl #status predicted <MAT> 

F; 26-324/Domain: extracellular #status predicted <EXT> 

F; 325-341/Domain : transmembrane #status predicted <TM1> 

F; 406-607/Domain: ATP-binding cassette homology <ABC> 

F ; 423-43 0/Region : nucleotide-binding motif A (P-loop) 

F;550-557/Region: nucleotide-binding motif B 

F; 794-810/Domain: transmembrane #status predicted <TM2> 

F; 829-845/Domain: transmembrane #status predicted <TM3> 

F; 878-894/Domain: transmembrane #status predicted <TM4> 

F; 909-925/Domain: transmembrane #status predicted <TM5> 

F ; 93 8-954/Domain : transmembrane #status predicted <TM6> 

F; 1025-1041/Domain: transmembrane #status predicted <TM7> 
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F; 50, 114, 165, 221/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 429/Binding site: ATP (Lys) #status predicted 

Query Match 24.9%; Score 835.5; DB 1; Length 1049; 

Best Local Similarity 30.5%; Pred. No. 1.8e-50; 

Matches 222; Conservative 134; Mismatches 257; Indels 115; Gaps 18; 

Qy 1 MSSSNVEVFIPVSQGNTNGFPATASNDLKAFTEGAVLSFHNICYRVKLKSGFLPCRKPVE 6 0 

: I I : : I : II : I I I I I I I I I : II 
Db 355 LGSSKSPIRLP-DEDAVNNFLQNEDDTL ATLSFENITYSVPSINS DGVE 402 

Qy 61 KEILSNINGIMKPG-LNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLINGAPRP-ANFK 118 

: : I : I : I I : I I I : I I : I : I I I : : I I I : I I : : : I I : : I I : I 

Db 403 ETVLNEISGIVKPGQILAIMGGSGAGKTTLLDILAMKRKTGHVSGSIKVNGISMDRKSFS 462 

Qy 119 CNSGYWQDDVVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVADSKV 178 

I : I I I I : : I I I I I : I I I I I : : I I : : I : : I I : : I : 
Db 463 KIIGFVDQDDFLLPTLTVFETVLNSALLRLPKALSFEAKKARVYKVLEELRIIDIKDRII 522 

Qy 179 GTQFIRGVSGGERKRTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQ-GRT 237 

I : I I I : I I I I : : I I I I I : I I : I I I I I I I : I I I : I I I I : I I : I II 
Db 523 GNEFDRGISGGEKRRVSIACELVTSPLVLFLDEPTSGLDASNANNVIECLVRLSSDYNRT 582 

Qy 23 8 IIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDI 297 

: : I I I I I I : I I I I I I I I : I : : : I I : : : : I I I I I I : : I I 

Db 583 LVLSIHQPRSNIFYLFDKLVLLSKGEMVYSGNAKKVSEFLRNEGYICPDNYNIADYLIDI 642 

Qy 298 INGDSTAV 305 

: I 

Db 643 TFEAGPQGKRRRIRNISDLEAGTDTNDIDNTIHQTTFTSSDGTTQREWAHLAAHRDEIRS 702 

Qy 306 ALNREEDFKATE I lEPSKQDKPLIEKLAEI YVNSSFYKETKAELHQ-LSGGEKKKK 360 

I I I I : I : I 11:1 I : I : I I | : : | | | : : : 

Db 703 LLRDEEDVEGTDGRRGATEIDLNTKLLHDK YKDSVYYAELSQEIEEVLSEGDEESN 758 

Qy 361 IT — VFKEISYTTSFCHQLRWVSKRSFKNLLGNPQASIAQIIVTVVLGLVIGAIYFGLKN 418 

: : I II : : I I I I I : II: : : : | : : | | : | : | : : | 

Db 759 VLNGDLPTGQQSAGFLQQLSILNSRSFKNMYRNPKLLLGNYLLTILLSLFLGTLYYNVSN 818 

Qy 419 DSTGIQNRAGVLFFLTTNQCFSSVSAVELFWEKKLFIHEYISGYYRVSSYFLGKLLSDL 4 78 

I : I I I I I : I I : I I : : : I : I : : I I I : I I : | : : | : : | : : 

Db 819 DISGFQNRMGLFFFILTYFGFVTFTGLSSFALERIIFIKERSNNYYSPLAYYISKIMSEV 878 

Qy 479 LPMTMLPSIIFTCIVYFMLGLKPKADAFFVMMFTLMMVAYSASSMALAIAAGQSWSVAT 538 

: I : : : I I : : I I I I I I I : I I I : I : : I II : : : 

Db 879 VPLRVVPPILLSLIVYPMTGLNMKDNAFFKCIGILILFNLGISLEILTIGIIFEDLNNSI 938 

Qy 539 LLMTICFVFMMIFSGLLV NLTTIASWLSWLQYFS IPRYGFTALQHNEF 586 

: I : : : : I I I I : I : I : I : I : I I : I : : I II 

Db 939 ILSVLVLLGSLLFSGLFINTKNITNVA — FKYLKNFSVFYYAYESLLINEVKTLMLKERK 996 

Qy 58 7 LGQNF-CPGLNATGNNPCNYATCTGEEYLVKQGI — DLSPWGLWKNHVALACMIVIFLTI 6 43 

I I I I II 11:1: 111:11: 
Db 997 YGLNIEVPG ATILSTFGFVVQNLVFDIK ILALFNWFLIM 1036 

Qy 644 AYLKLLFL 651 

III:: 
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Db 1037 GYLALKWI 1044 



RESULT 3 
FYFFW 

white protein - fruit fly (Drosophila melanogaster ) 
C; Species: Drosophila melanogaster 

C;Date: 31-Dec-1990 #sequence_revision 17-Feb-1995 #text_change 09-Jul-2004 
C;Accession: S08635; S07263; S10240 
R;Pepling, M.; Mount, S.M. 
Nucleic Acids Res. 18, 1633, 1990 

A; Title: Sequence of a cDNA from the Drosophila melanogaster white gene. 
A;Reference number: S08635; MUID : 90221 89 7 ; PMID:2109311 
A;Accession: S08635 
A; Molecule type: mRNA 
A;Residues: 1-687 <PEP> 

A;Cross-references : UNIPROT : P10090; UNIPARC : UPIOOOOllFOAl ; EMBL:X51749; NID:g8825; PIDN: 
CAA36038.1; PID:g8826 

R;0'Hare, K.; Murphy, C; Levis, R. ; Rubin, G.M. 
J. Mol. Biol. 180, 437-455, 1984 

A; Title: DNA sequence of the white locus of Drosophila melanogaster. 
A;Reference number: S07263; MUID : 85134865; PMID:6084717 
A;Accession: S07263 
A;Molecule type: DNA 

A;Residues : 1-24, ' LIFEIPYHCRVTAD ' ,30- 

334, ' ITLHLNSYPAWVPSVLPTTIRRTFTYRCWPLCPDGRSSPVIGSPRYG' ,372-687 <0HA1> 
A;Cross-references : UNIPARC : UPI 00 00 1 6B5 7 1 ; EMBL:X02974 
A; Experimental source: strain Canton S 
R;0'Hare, K. 

submitted to the EMBL Data Library, June 1985 
A;Reference number: S10240 
A;Accession: S10240 
A;Molecule type: DNA 

A;Residues: 1-24 ,' LIFEIPYHCRVTAD ', 3 0-6 8 7 <0HA2> 

A;Cross-references : UNIPARC : UPI 00 00 1 6BDFF ; EMBL:X02974; NID:gl0873; PIDN : CAA26 716 . 1 ; PID: 
gl0874 

A; Experimental source: strain Canton S 
C; Genetics : 
A;Gene: white; w 

A; Cross-references : FlyBase : FBgn0 003 996 
A;Introns: 24/3; 116/1; 334/2; 439/3; 483/3 

C; Superf amily : fruit fly white protein; ATP-binding cassette homology 
C;Keywords: ATP; glycoprotein; nucleotide binding; P-loop; transmembrane protein 
F; 113-317/Domain: ATP-binding cassette homology <ABC> 
F ; 13 0-13 7/Region : nucleotide-binding motif A (P-loop) 
F; 261-265/Region : nucleotide-binding motif B 

F; 67, 93, 472, 554, 651/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 24.2%; Score 812; DB 1; Length 687; 

Best Local Similarity 32.1%; Pred. No. 4.4e-49; 

Matches 210; Conservative 134; Mismatches 251; Indels 60; Gaps 17; 

Qy 5 NVEVFIPVSQGNTNGFPATASNDLKAFTEGAVLSFHNICYRVKLKSGFLPCRKPVEKEIL 6 4 

I : : : I I : I I : I II : : I I : I : I 
Db 74 NMDIFGAVNQ PGSGWRQLVNRTRGLFCNERHI PAPR KHLL 113 

Qy 65 SNINGIMKPG-LNAILGPTGGGKSSLLDVLAARKDPSGL SGDVLINGAPRPA-NFK 118 
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1:1: I I I I : : I : I I I : : I I : I I I I I : II I : I I I I : 

114 KNVCGVAYPGELLAVMGSSGAGKTTLLNALAFR-SPQGIQVSPSGMRLLNGQPVDAKEMQ 172 

119 CNSGYWQDDVVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVADSKV 178 
I I I I I : : I : I I I I : I I I : I : : I : : I : : : I I I I I I I : : 

173 ARCAYVQQDDLFIGSLTAREHLIFQAMVRMPRHLTYRQRVARVDQVIQELSLSKCQHTII 232 

179 GTQ-FIRGVSGGERKRTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQGRT 23 7 
I : : I : I I I I I I I : I : I I I : I I I I I : I I I I I I : : I : : I I : : I : : I : I 

23 3 GVPGRVKGLSGGERKRLAFASEALTDPPLLICDEPTSGLDSFTAHSVVQVLKKLSQKGKT 2 92 

23 8 IIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDI 297 

: I : I I I I : I : I I I : I : I I I : I I I I : : I I I | | | | | : : : 

293 VILTIHQPSSELFELFDKILLMAEGRVAFLGTPSEAVDFFSYVGAQCPTNYNPADFYVQV 3 52 

29 8 INGDSTAVALNREEDFKATEIIEPSKQDKPLIEKLAEIYVNSSFYKETKAELHQLSGGEK 357 
: II II:: : : : | : | | : | : : : | | : 

353 L AWPGREIESR DRIAKICDNFAISKVAR-DMEQLLATKN 3 91 

35 8 KKKITVFKEISYT — TSFCHQLRWVSKRSFKNLLGNPQASIAQIIVTWLGLVIGAIYFG 415 
: I I II : : III I I : : : I I : : | | : : : : | | | : | 

3 92 LEKPLEQPENGYTYKATWFMQFRAVLWRSWLSVLKEPLLVKVRLIQTTMVAILIGLIFLG 451 

416 LKNDSTGIQNRAGVLFFLTTNQCFSSVSA-VELFWEKKLFIHEYISGYYRVSSYFLGKL 4 74 

: 1:1 1:1 II I : I I : : I I : I : I I II : I I I I I 

452 QQLTQVGVMNINGAIFLFLTNMTFQNVFATINVFTSELPVFMREARSRLYRCDTYFLGKT 511 

4 75 LSDLLPMTMLPSIIFTCIVYFMLGLKPKADAFFVMMFTLMMVAYSASSMALAIAAGQSW 53 4 

: : : I I : : : : I I I I I : I I : II : : : I I : : I I : I 

512 lAE-LPLFLTVPLVFTAIAYPMIGLRAGVLHFFNCLALVTLVANVSTSFGYLISCASSST 5 70 

53 5 SVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTALQHNEFLGQNFCPG 5 94 
1:1 : : : : I I : I : : : I I I I I II I I : : II 

571 SMALSVGPPVIIPFLLFGGFFLNSGSVPVYLKWLSYLSWFRYANEGLLINQWADVE — PG 6 28 

595 -LNATGNNPCNYATCTGEEYLVKQGIDLSPWGLWKNHVALACMIVIFLTIAYLKL 648 
: : I : I II : : : : : I I : : | | | : | | | : I I I I 

629 EISCTSSN TTCPSSGKVILETLNFSAADLPLDYVGLAILIVSFRVLAYLAL 679 



RESULT 4 
T46101 

ABC transporter-like protein - Arabidopsis thaliana 

N; Alternate names: protein T25B15.80 

C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 04-Feb-2000 #sequence_revision 04-Feb-2000 #text_change 09-Jul-2004 
C;Accession: T46101 

R;Alcaraz, J. P.; Clabault, G.; Cottet, A.; Mache, R.; Mewes, H.W.; Lemcke, K.; Mayer, K.F. 
X.; Quetier, F.; Salanoubat, M. 

submitted to the Protein Sequence Database, January 2000 

A;Reference number: Z23021 

A;Accession: T46101 

A; Status: preliminary 

A;Molecule type: DNA 

A;Residues: 1-737 <ALC> 

A;Cross-references : UNIPROT : Q9FT5 1 ; UNIPARC : UPI 0 00 0 0A6 9DB ; EMBL : AL132972 
A; Experimental source: cultivar Columbia; BAC clone T25B15 



http://es/ScoreAccessWeb/GetItem.action?AppId=099610...7_14291 l_us-09-961-086a-l.rpr&ItemType=4&startByte=0 (7 of 24)9/22/2008 12:04:28 PM 



SCORE Search Results Details for Application 09961086 and Search Result 20080917_142911_us-09-961-086a-l.rpr. 

C; Genetics : 
A;Map position: 3 

A;Introns: 122/1; 146/3; 225/2; 277/2; 338/3; 422/2; 535/1; 628/3; 664/3 
A;Note: T25B15.80 

Query Match 23.9%; Score 800.5; DB 2; Length 737; 

Best Local Similarity 31.4%; Pred. No. 3.1e-48; 

Matches 211; Conservative 132; Mismatches 227; Indels 101; Gaps 21; 

Qy 27 DLKAFTEGAV LSFHNICYRVKLKSGFLPCRKPVEKEILSNINGIMKPG- 74 

I : : I I I 11:11:11 I I I I : I : I II 

Db 124 DIEAATSSVVKFQAEPTFPIYLKFIDITYKVTTKG MTSS SEKS ILNGI SGSAYPGE 179 



75 LNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLINGAPRPANFKCNSGYWQDDWMGTL 134 
I I : : I I : I I I : : I I : I I : : I I I I : I I : I I I I I : I 
180 LLALMGPSGSGKTTLLNALGGRFNQQNIGGSVSYNDKPYSKHLKTRIGFVTQDDVLFPHL 23 9 



Qy 135 TVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVADSKVGTQFIRGVSGGERKRT 194 

I I : I I : : I I I I I : I I I : I I I I I I I I : : I : : I I : I I I I I I I I I I 
Db 240 TVKETLTYTALLRLPKTLTEQEKEQRAASVIQELGLERCQDTMIGGSFVRGVSGGERKRV 299 



Qy 195 SIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQGRTIIFSIHQPRYSIFKLFD 254 

I I I : : I : I I : I I I I I I : I I I : I I : : : I : : I I : I I : : I I I I : I II 

Db 300 CIGNEIMTNPSLLLLDEPTSSLDSTTALKIVQMLHCIAKAGKTIVTTIHQPSSRLFHRFD 359 



255 SLTLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDIINGDSTAVALNREEDFK 314 

I : I : I I : : I I I I : I I I I | | | : | | | : : | | : : : 
360 KLWLSRGSLLYFGKASEAMSYFSSIGCSPLLAMNPAEFLLDLVNGNMNDIS 411 



315 ATEIIEPSKQDKPLIEKLAEIYVNS SFYKETKAELHQLS GGE 356 

: : : : I I : I I : I I : II I : : : I 

412 VPSALKEKMKIIRL-ELYVRNVKCDVETQYLEEAYKTQIAVMEKMKLMAPVPLDEE 466 



Qy 357 KKKKITVFKE ISYTTSFC-HQLRWVSKRS FKNLLGNPQASIAQIIVTVVLGLVI 409 

I I I I : I : : I I I : : I II : I : : I : : : 
Db 467 VKLMITCPKREWGLSWWEQYCLLSLRGIKERRHDYFSWL RVTQVLSTAI IL 517 



410 GAIYFGLKNDSTGIQ-NRAGVLFFLTTNQCFSSV-SAVELFWEKKLFIHEYISGYYRVS 46 7 
I : : : : : I I : I : I : I I I : I I : I : I I : : I I I I : I 

518 GLLWW — QSDITSQRPTRSGLLFFIAVFWGFFPVFTAIFTFPQERAMLSKERESNMYRLS 5 75 



46 8 SYFLGKLLSDLLPMTMLPSIIFTCIVYFMLGLKPKADAFFVMMFTLMMVAYSASSMALAI 52 7 
: I I : : Mil::: : : | : | | | | | | : : | : : | | : : | : : : | : | | | 

5 76 AYFVARTTSD-LPLDLILPVLFLVWYFMAGLRLRAESFFLSVLTVFLCIVAAQGLGLAI 634 



52 8 AAGQSWSVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTAL QHN 5 84 

I : III:: : I : I I : : : : I : : : I : : I I : 

63 5 GASLMDLKKATTLASVTVMTFMLAGGYFVK — KVPFFIAWIRFMSFNYHTYKLLVKVQYE 6 92 



585 EFLGQNFCPGLNATGNNPCNYATCTGEEYLVKQGIDLSPWGLWKNHVALACMIVIFLTIA 6 44 

I : : I I I : : I : | | | | | : : : | 

693 EIM ESVNGEE — lESGL KEVSALVAMI IGYRLVA 724 



645 YLKLLFLKKYS 655 

I I : I : I 
725 YFSLRRMKLHS 735 
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RESULT 5 
C86441 

probable ABC transporter [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 09-Jul-2004 
C;Accession: C86441 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, 0.; Alonso, 
J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; Chan, A.; Chao, Q.; 
Chen, H.; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L.; Conway, A.B.; Conway, A.R.; 
Creasy, T.H.; Dewar, K.; Dunn, P.; Etgu, P.; Feldblyum, T.V.; Feng, J.; Fong, B.; Fujii, C. 
Y.; Gill, J.E.; Goldsmith, A.D.; Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S.; Khaykin, E.; Kim, C.J.; 
Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B.; Langin-Hooper , S.; Lee, A.; 
Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y.; Lin, X.; Liu, S.X.; Liu, Z.A.; Luros, J.S.; Maiti, 
R.; Marziali, A.; Militscher , J.; Miranda, M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; 
Pai, G.; Peterson, J.; Pham, P.K.; Rizzo, M. ; Rooney, T.; Rowley, D.; Sakano, H. 
A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; Tallon, L. 
J.; Tambunga, G.; Toriumi, M.J.; Town, CD.; Utterback, T.; van Aken, S.; Vaysberg, M.; 
Vysotskaia, V.S.; Walker, M.; Wu, D.; Yu, G.; Eraser, CM.; Venter, J.C; Davis, R.W. 
A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 
A;Reference number: A86141; MUID : 21 0 16 71 9 ; PMID : 1 1 1 3 0 71 2 
A;Accession: C86441 
A; Status: preliminary 
A;Molecule type: DNA 
A;Residues: 1-646 <STO> 

A;Cross-references : UNIPROT : Q9C6R7 ; UNIPARC : UPI 0 00 0 0AA9CB ; GB:AE005172; NID : gl 1 1 36 73 4 ; PIDN: 
AAG31315.1; GSPDB : GNO 01 41 
C ; Genetics : 
A;Map position: 1 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein F12L6.1; ATP- 
binding cassette homology 

Query Match 23.1%; Score 774; DB 2; Length 646; 

Best Local Similarity 33.5%; Pred. No. 1.9e-46; 

Matches 213; Conservative 119; Mismatches 234; Indels 70; Gaps 22; 

Qy 3 7 LSFHNICYRVKLK — SGFLPCRKPVEKEILSNINGIMKPG-LNAILGPTGGGKSSLLDVL 93 

: : : I : I I : : I : I I I I I : I I : : I I | : | | | : | | | : : | | | 

Db 51 ITLKEWYKVKIEQTSQCMGSWKSKEKTILNGITGMVCPGEFLAMLGPSGSGKTTLLSAL 110 

Qy 9 4 AARKDPSGLSGDVLINGAPRPANFKCNSGYWQDDVVMGTLTVRENLQFSAALRLATTMT 153 

I : 111:111 I : I : I I I I I : | | | | | | : | | | | : : : | 

Db 111 GGRLSKT-FSGKVMYNGQPFSGCIKRRTGFVAQDDVLYPHLTVWETLFFTALLRLPSSLT 169 

Qy 154 NHEKNERINRVIQELGLDKVADSKVGTQFIRGVSGGERKRTSIGMELITDPSILFLDEPT 213 

II I : : I I I MM:: : | : | II : I II I : I I III I : : : I I : I I II II 

Db 170 RDEKAEHVDRVIAELGLNRCTNSMIGGPLFRGISGGEKKRVSIGQEMLINPSLLLLDEPT 229 

Qy 214 TGLDSSTANAVLLLLKRMSKQGRTIIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPAQEA 273 

: I I I I : I I : : : : I I : : I II : : : II II | : : || : | | : | : : : I I I 

Db 230 SGLDSTTAHRIVTTIKRLASGGRTWTTIHQPSSRIYHMFDKVVLLSEGSPIYYGAASSA 289 

Qy 274 LGYFESAGYHCEAYNNPADFFLDIING — DSTAVALNREEDFKATEIIEPSKQDKPLIEK 331 

: I I I I : I I I I 11:11 I : : I I : I : I : I 

Db 290 VEYFSSLGFSTSLTVNPADLLLDLANGIPPDTQKETSEQEQKTVKETL-VSAYEKNISTK 348 
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332 L-AEI-YVNSSFYKETKAELHQLSGGEKKKKITVFKEISYTTSFCHQL RWVSKRS 384 

III: 11:111 I I : I : : : I I I : I 

349 LKAELCNAESHSYEYTKAAAKNL KSEQWCTTWWYQFTVLLQRGVRERR 3 96 



3 85 FKNLLGNPQASIAQIIVTVVLGLVIGAIYFGLKNDSTGIQNRAGVLFFLTTNQCFSSV-S 4 43 
I : : : I I : I II I : : : : I I : I : I I I : I : : 

397 FESF NKLRIFQVISVAFLG GLLWW — HTPKSHIQDRTALLFFFSVFWGFYPLYN 448 



44 4 AVELFWEKKLFIHEYISGYYRVSSYFLGKLLSDLLPMTMLPSIIFTCIVYFMLGLKPKA 5 03 

II I I I : : I I I I I I : I I I I : : : I I I : : I I : I : I I I I I 

44 9 AVFTFPQEKRMLIKERSSGMYRLSSYFMARNVGD-LPLELALPTAFVFIIYWMGGLKPDP 507 



504 DAFFVMMFTLMMVAYS ASSMALAIAAGQSWSVATLLMTICFVFMMIFSGLLVNLTT 560 

I : : I : : I I I I : I I I : III:: : : I I I 

50 8 TTF ILSLLWLYSVLVAQGLGLAFGALLMNIKQATTLASVTTLVFLIAGGYYVQ — Q 562 



561 lASWLSWLQYFSIPRYGFTALQHNEFLGQNFCPGLNATGNNPCNYATCT GEEY 613 

I : : I I : I I I : I I : I : : I I : I : 

563 IPPFIVWLKYLSYSYYCYKLLL GIQYTDD DYYECSKGVWCRVGDFP 608 



614 LVK-QGIDLSPWGLWKNHVALACMIVIFLTIAYLKL 648 
: I I : : II: : I : I : : I I : I 

609 AIKSMGLN NLWIDVFVMGVMLVGYRLMAYMAL 640 



RESULT 6 
D96553 

hypothetical protein F5D21.6 [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 09-Jul-2004 
C;Accession: D96553 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, 0.; Alonso, 
J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; Chan, A.; Chao, Q.; 
Chen, H.; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L.; Conway, A.B.; Conway, A.R.; 
Creasy, T.H.; Dewar, K.; Dunn, P.; Etgu, P.; Feldblyum, T.V.; Feng, J.; Fong, B.; Fujii, C. 
Y.; Gill, J.E.; Goldsmith, A.D.; Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S.; Khaykin, E.; Kim, C.J.; 
Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B.; Langin-Hooper , S.; Lee, A.; 
Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y.; Lin, X.; Liu, S.X.; Liu, Z.A.; Luros, J.S.; Maiti, 
R.; Marziali, A.; Militscher , J.; Miranda, M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; 
Pai, G.; Peterson, J.; Pham, P.K.; Rizzo, M. ; Rooney, T.; Rowley, D.; Sakano, H. 
A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; Tallon, L. 
J.; Tambunga, G.; Toriumi, M.J.; Town, CD.; Utterback, T.; van Aken, S.; Vaysberg, M.; 
Vysotskaia, V.S.; Walker, M.; Wu, D.; Yu, G.; Eraser, CM.; Venter, J.C; Davis, R.W. 
A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 
A;Reference number: A86141; MUID : 21 0 16 71 9 ; PMID : 1 1 1 3 0 71 2 
A;Accession: D96553 
A; Status: preliminary 
A;Molecule type: DNA 
A;Residues: 1-687 <STO> 

A;Cross-references : UNIPROT : Q9C8K2 ; UNIPARC : UPI00000AB8E7 ; GB:AE005173; NID : gl 00 923 49 ; PIDN: 
AAG12758.1; GSPDB : GNO 01 41 
C; Genetics : 
A;Gene: F5D21.6 
A;Map position: 1 
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C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein F12L6.1; ATP- 
binding cassette homology 

Query Match 22.9%; Score 767.5; DB 2; Length 687; 

Best Local Similarity 32.6%; Pred. No. 5.9e-46; 

Matches 212; Conservative 118; Mismatches 244; Indels 77; Gaps 21; 

Qy 3 4 GAVLSFHNICYRVKLKSGFLPCRKPVEKEILSNINGIMKPG-LNAILGPTGGGKSSLLDV 92 

III:::: : II II: : I : I I : I I : I I : I I : I I I I : I I I 

Db 21 GAYLAWEDLTVVIPNFSGG-PTRR LLDGLNGHAEPGRIMAIMGPSGSGKSTLLDS 7 4 



93 LAARKDPSG-LSGDVLINGAPRPANFKCNSGYWQDDWMGTLTVRENLQFSAALRLATT 151 
III : : : I : : I : I I : : | | | : | : : | | | | | | | | : : | | | | | : : 

75 LAGRLARNVIMTGNLLLNGKKARLDYGL-VAYVTQEDILMGTLTVRETITYSAHLRLSSD 133 



152 MTNHEKNERINRVIQELGLDKVADSKVGTQFIRGVSGGERKRTSIGMELITDPSILFLDE 211 

: I II:: I I I I I II : I I I I I I I I I I I I :: I :: I I I I I I I I 

13 4 LTKEEVNDIVEGTIIELGLQDCADRVIGNWHSRGVSGGERKRVSVALEILTRPQILFLDE 193 



Qy 212 PTTGLDSSTANAVLLLLKRMSKQ-GRTIIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPA 270 

I I : I I I I : : I I : I : : : : I I I :: I I I I I : I I I I I I I : I I : : I : 

Db 194 PTSGLDSASAFFVIQALRNIARDGGRTVVSSIHQPSSEVFALFDDLFLLSSGETVYFGES 253 



2 71 QEALGYFESAGYHCEAYNNPADFFLDI INGDSTAVALNREEDFKATE I lEPSKQDKPL IE 3 3 0 

: I : : I I I : I I I : I I I III I : : I I : III: 

25 4 KFAVEFFAEAGFPCPKKRNPSDHFLRCINSDFDTVTATLKGSQRIRE — TPATSD-PLMN 310 



Qy 331 KLAEIYVNSSFYKETKAELHQLSGGE KKKKITVFKEISYTTSFC 374 

: I I I 1:1 I : : : I : I I : I I I 
Db 311 LATSEIKARLVENYRRSVYAKSAKSRIRELASIEGHHGMEVRKGSEATWFK 361 



3 75 HQLRWVSKRSFKNLLGNPQASIAQIIVTWLGLVIGAIYFGLKNDSTGIQNRAGVLFFLT 43 4 
I I I : : I I I I I : : : : I : : : I : : | | : : : : | | | | : | 

36 2 -QLRTLTKRSFVNMCRDIGYYWSRIVIYIVVSFCVGTIFYDVGHSYTSILARVSCGGFIT 420 



43 5 TNQCFSSVSAVELFVVEKKLFIHEYISGYYRVSSYFLGKLLSDLLPMTMLPSIIFTCIVY 4 94 

II: I : I I : I I : I I I I I I I : : I I : : : I II 

421 GFMTFMSIGGFPSFIEEMKVFYKERLSGYYGVSVYIISNYVSS-FPFLVAIALITGSITY 4 79 



495 FMLGLKPKAD — AFF VMMFTLMMVAYSASSMALAIAAGQSWSVATLLMTI 543 

I : : I III I : : I : I I : : : | : : : : : | | 

48 0 NMVKFRPGVSHWAFFCLNIFFSVSVIESLMMVVASLVPNFLMGLITGAGIIGI — IMMTS 53 7 



54 4 CFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTALQ HNEFLGQNFCPGLNATGN 6 00 

I I I : I : I I 111:1 I : I I I I I 
53 8 GFF RLLPDLPKV FWRYPI SFMSYGSWAIQGAYKNDFLGLEFDPMF 5 82 



601 NPCNYATCTGEEYLVK-QGIDLSP WGLWKNHVALACMIVIFLTIAYLK 647 

I I I : : I I : : : II : I I : : I : II 

583 — AGEPKMTGEQVINKIFGVQVTHSKWWDLSAIVLILVCYRILFFIVLKLK 631 



RESULT 7 
A84509 

probable ABC transporter [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Feb-2001 #sequence_revision 02-Feb-2001 #text_change 09-Jul-2004 
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C;Accession: A84509 

R;Lin, X.; Kaul, S.; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; Fujii, C.Y.; 
Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.V.; Buell, C.R.; Ketchum, K.A.; 
Lee, J.J.; Ronning, CM.; Koo, H.; Moffat, K.S.; Cronin, L.A.; Shen, M.; VanAken, S.E.; 
Umayam, L.; Tallon, L.J.; Gill, J.E.; Adams, M.D.; Carrera, A.J.; Creasy, T.H.; Goodman, H. 
M.; Somerville, C.R.; Copenhaver, G.P.; Preuss, D.; Nierman, W.C; White, 0.; Eisen, J. A.; 
Salzberg, S.L.; Eraser, CM.; Venter, J.C 
Nature 402, 761-768, 1999 

A; Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. 

A;Reference number: A84420; MUID : 20 0 83 48 7 ; PMID : 1 06 1 7 1 9 7 

A;Accession: A84509 

A; Status: preliminary 

A;Molecule type: DNA 

A;Residues: 1-649 <STO> 

A;Cross-references : UNIPROT : Q9S IT6 ; UNIPARC : UPI00000A95A1 ; GB:AE002093; NID : g4558665; PIDN: 
AAD22683.1; GSPDB : GN00139 
C; Genetics : 
A;Gene: At2gl3610 
A;Map position: 2 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein F12L6.1; ATP- 
binding cassette homology 

Query Match 22.3%; Score 746; DB 2; Length 649; 

Best Local Similarity 33.5%; Pred. No. 1.8e-44; 

Matches 203; Conservative 116; Mismatches 239; Indels 48; Gaps 18; 

Qy 61 KEILSNINGIMKP-GLNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLINGAP-RPANFK 118 

1:1 : II : I I : I I : I I I I I I I :: I I I I I : I I : I I I I I I 

Db 6 0 KHVLKGVTCRAKPWEILAIVGPSGAGKSSLLEILAARLIPQ — TGSVYVNKRPVDRANFK 117 

Qy 119 CNSGYWQDDVVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVADSKV 178 

I I I I I I : I I I I I I I I I I I : I I : : : I I I I : I I : : I 
Db 118 KISGYVTQKDTLFPLLTVEETLLFSAKLRL — KLPADELRSRVKSLVHELGLEAVATARV 175 

Qy 179 GTQFIRGVSGGERKRTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSK-QGRT 237 

I : I I : I I I I I : I I I I : I : I II : I I I I I I : I I I I :: I : : : I I I : : : I I I 

Db 176 GDDSVRGISGGERRRVSIGVEVIHDPKVLILDEPTSGLDSTSALLIIDMLKHMAETRGRT 235 

Qy 23 8 IIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDI 297 

I I : I I I I : I I I : I : I I I : I : I : I III : I : I : : 
Db 236 IILTIHQPGFRIVKQFNSVLLLANGSTLKQGSVDQLGVYLRSNGLHPPLHENIVEFAIES 295 

Qy 298 INGDSTAVALNREEDFKATEIIEPSKQDKPLIEKLAEIYVNSSFYKETKAELHQLSGGEK 357 

1:1 : I : I : : I I I I : I I I I I I I : 

Db 296 I — ESITKQQRLQESRRAAHVLTP QTTLQEKRSEDSQGES — KSGKFTLQQLFQQTR 348 

Qy 358 KKKI-TVFKEISYTTSFCH QLRWVSKRSFKNLLGNPQAS lAQIIVTVVLGLVIGAI 412 

: I : : I I : : : : I I I : : : : : | : | : | | 

Db 349 VADVGTMNIATEFTRDFANSRLEETMILTHRFSKNIFRTKELFACRTVQMLGSGIVLGLI 408 

Qy 413 YFGLKNDSTGIQNRAGVLFFLTTNQCFSSVSAVELFVVEKKLFIHEYISGYYRVSSYFLG 472 

: 11:1 I : I I : I : I | : : | : : | : | : : : : | I I I I I I I I : 

Db 409 FHNLKDDLKGARERVGLFAFILTFLLTSTIEALPIFLQEREILMKETSSGSYRVSSYAVA 468 

Qy 473 KLLSDLLPMTMLPSIIFTCIVYFMLGLKPKADAFFVMMFTLMMVAYSASSMALAIAA 529 

I II : : : I : I : I I : : : I I I II : : : | : | : | : : : | 

Db 469 NGLV-YLPFLLILAILFSTPVYWLVGLNPSFMAFLHFSLLIWLILYTANSVWCFSALVP 527 
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Qy 530 GQSWSVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTALQHNE 585 

111:1 : I I : I I I : : I : : : I I : : I I II 

Db 528 NFIVGNSVISG VMGSFF LFSGYFISNHEIPGYWIFMHYISLFKYPFEGFLINE 580 

Qy 586 FLGQNFCPGLNATGNNPCNYATCTGEEYLVKQGIDLSPWG LWKNHVALACMIVIFLT 6 42 

I II I I I I I : I : : I | : | | : | : : : : 

Db 581 FSKSNKC LEYGFGKC LVTEEDLLKE ERYGEESRWRNWIMLCFVLLYRF 629 

Qy 643 lAYLKL 648 

1:1: I 

Db 630 ISYVIL 635 

RESULT 8 
T47652 

ABC transporter-like protein - Arabidopsis thaliana 

N;Alternate names: protein T26I12.10 

C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 09-Jul-2004 
C;Accession: T47652 

R;Monfort, A.; Casacuberta, E.; Puigdomenech, P.; Mewes, H.W.; Lemcke, K.; Mayer, K.F.X. 
Quetier, F.; Salanoubat, M. 

submitted to the Protein Sequence Database, February 2000 

A;Reference number: Z24471 

A;Accession: T47652 

A; Status: preliminary 

A;Molecule type: DNA 

A;Residues: 1-725 <MON> 

A;Cross-references : UNIPROT : Q9M3D6 ; UNIPARC : UPI00000A7A62 ; EMBL : AL132954 
A; Experimental source: cultivar Columbia; BAC clone T26I12 
C; Genetics : 
A;Map position: 3 
A;Note: T26I12.10 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein F12L6.1; ATP- 
binding cassette homology 

Query Match 22.2%; Score 745; DB 2; Length 725; 

Best Local Similarity 29.3%; Pred. No. 2.4e-44; 

Matches 193; Conservative 130; Mismatches 274; Indels 62; Gaps 13; 

Qy 36 VLSFHNICYRVKLKSGFLPCRKPVEKEILSNINGIMKPG-LNAILGPTGGGKSSLLDVLA 94 

I I : I : I : I I I : I I : I : I : : : I | : | : | | : | | | | : | : | | | 

Db 72 VLNFNNLQYDVTLRRRFGFSRQNGVKTLLDDVSGEASDGDILAVLGASGAGKSTLIDALA 131 

Qy 95 ARKDPSGLSGDVLINGAP — RPANFKCNSGYVVQDDVVMGTLTVRENLQFSAALRLATTM 152 

I 111:11 : I 111:111:: | | | : | | | : : | | : : 

Db 132 GRVAEGSLRGSVTLNGEKVLQSRLLKVISAYVMQDDLLFPMLTVKETLMFASEFRLPRSL 191 

Qy 153 TNHEKNERINRVIQELGLDKVADSKVGTQFIRGVSGGERKRTSIGMELITDPSILFLDEP 212 

: : I I I : : I : I I I I : : : I : I I I I I I I I : I I I I ::: I I I : I I I I I I 
Db 192 SKSKKMERVEALIDQLGLRNAANTVIGDEGHRGVSGGERRRVSIGIDIIHDPIVLFLDEP 251 

Qy 213 TTGLDSSTANAVLLLLKRMSKQGRTIIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPAQE 272 

I : I I I I : I I : : I I I : : : I : I I I I I I | : | | | : | : | : : | : | 

Db 252 TSGLDSTNAFMWQVLKRIAQSGSIVIMSIHQPSARIVELLDRLIILSRGKSVFNGSPAS 311 
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Qy 273 ALGYFESAGYHCEAYNNPADFFLDII NGDSTAVALNREEDFKATEI lEPS 322 

1:1 I I : : I I I : : I II: : I : : 

Db 312 LPGFFSDFGRPIPEKENISEFALDLVRELEGSNEGTKALVDFNEKWQQNKISLIQSAPQT 3 71 



Qy 323 KQDKPLIEKLAEIYVNSSFYKETKAELHQLSGGEKKKKITVFKEISYTTSFCHQLRW 3 79 

11:111 : I : I : : : I : I I : : : II : 

Db 3 72 NKLDQDRSLSLKEA INASV SRGKL — VSGSSRSNPTSMETVSSYANPSLFETFI 423 



Qy 380 VSKRSFKNLLGNPQASIAQIIVTVVLGLVIGAIYFGLKNDSTGIQNRAGVLFFLTTNQCF 439 

: : I I II: I : : I : I I : : : I : I : III : I : : 

Db 42 4 LAKRYMKNWIRMPELVGTRIATVMVTGCLLATVYWKLDHTPRGAQERLTLFAFVVPTMFY 4 83 



44 0 SSVSAVELFWEKKLFIHEYISGYYRVSSYFLGKLLSDLLPMTMLPSIIFTCIVYFMLGL 4 99 
: I : I : I : : I : I Mill: I II : I I : : I : I : : : I I 

48 4 CCLDNVPVFIQERYIFLRETTHNAYRTSSYVISHSLVS-LPQLLAPSLVFSAITFWTVGL 5 42 



50 0 KPKADAFFVMMFTLMMVAYSAS SMALAI AAGQS VVS VATLLMT I CFVFMMI FS GLLVNLT 559 
: I : : I I I : I : : : : : : : : | | | | 

543 SGGLEGFVFYCLLIYASFWSGSSVVTFISGWPNIMLCYMVSITYLAYCLLLSGFYVNRD 6 02 



560 TIASWLSWLQYFSIPRYGFTALQHNEFLGQNFC PGLNATG 599 

I : : I I I I : I : I : I I I : I I : : : I 

6 03 RIPFYWTWFHYISILKYPYEAVLINEFDDPSRCFVRGVQVFDSTLLGGVSDSGKVKLLET 6 62 



600 NNPCNYATC — TGEEYLVKQGI-DLSPWGLWKNHVALACMIVIFLTIAYLKLLF 650 

: I I I I : I : I I I III I : : I : : : : I I 

663 LSKSLRTKITESTCLRTGSDLLAQQGITQLSKWD CLWITFASGLFFRILF 712 



RESULT 9 
T45891 

ABC transporter-like protein - Arabidopsis thaliana 

N; Alternate names: protein F4P12.210 

C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 04-Feb-2000 #sequence_revision 04-Feb-2000 #text_change 09-Jul-2004 
C;Accession: T45891 

R;Bloecker, H.; Mewes, H.W.; Lemcke, K.; Mayer, K.F.X.; Quetier, F.; Salanoubat, M. 

submitted to the Protein Sequence Database, January 2000 

A;Reference number: Z23016 

A;Accession: T45891 

A; Status: preliminary 

A;Molecule type: DNA 

A;Residues: 1-739 <BLO> 

A;Cross-references : UNIPROT : Q9LFG8 ; UNIPARC : UPI00000A86EE; EMBL : AL132966 
A; Experimental source: cultivar Columbia; BAC clone F4P12 
C; Genetics : 
A;Map position: 3 
A;Note: F4P12.210 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein F12L6.1; ATP- 
binding cassette homology 

Query Match 22.1%; Score 741.5; DB 2; Length 739; 

Best Local Similarity 28.3%; Pred. No. 4.4e-44; 

Matches 193; Conservative 123; Mismatches 256; Indels 111; Gaps 17; 

Qy 36 VLSFHNICYRVKLKSGF — LPC — RKPVE KEILSNINGIMKPG-LNAILGPT 82 

I I I I : : I I I : I I II I : I : I : I : I : | : | : | | : 
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Db 8 7 VLSFKDLTYSVKIKKKFKPFPCCGNSPFDGNDMEMNTKVLLNGISGEAREGEMMAVLGAS 146 



83 GGGKSSLLDVLAARKDPSGLSGDVLINGAPRPANF-KCNSGYVVQDDWMGTLTVRENLQ 141 
1111:1:1111 I I I : : I I : : I | | | : | | | : : I I I I I 

14 7 GSGKSTLIDALANRISKESLRGDITLNGEVLESSLHKVISAYVMQDDLLFPMLTVEETLM 2 06 



Qy 142 FSAALRLATTMTNHEKNERINRVIQELGLDKVADSKVGTQFIRGVSGGERKRTSIGMELI 201 

III II:::: : I I : : I : I I I I : : I : I I I I I I I I : I I I I :: I 

Db 207 FSAEFRLPSSLSKKKKKARVQALIDQLGLRNAAKTVIGDEGHRGVSGGERRRVSIGTDII 266 



Qy 202 TDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQGRTIIFSIHQPRYSIFKLFDSLTLLAS 261 

I I I I I I I I I I : I I I I :: I I : : I : I : : : I : I I I I I I I I III I : 
Db 267 HDPIILFLDEPTSGLDSTSAYMWKVLQRIAQSGSIVIMSIHQPSYRILGLLDKLIFLSR 326 



26 2 GRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDIINGDSTAVALNREEDFKATEIIEP 3 21 

I : : I : I I : | : | | | : | I : : 

327 GNTVYSGSPTHLPQFFSEFGHPIPENENKPEFALDLIR ELEDS 369 



322 SKQDKPLIEKLAEIYVNSSFYKETKAELHQLSGGEKKKKITVFKEI 367 

: I I : I I : I : : I : I I : : | : 

370 PEGTKSLVE FHKQWRAK — QTSSQSRRNTNVSLKDAISASISRGKLVSGA 417 



368 SYTTSFCHQLRWVSKRSFKNLLGNPQASIAQIIVTWLGLVIGAIYFGLKND 419 

: : I : : : I I I I I : : : : I I : : : I : : I I 

418 TNLRSSFQTFANPFWTEMLVIGKRSILNSRRQPELFGIRLGAVLVTGMILATIFWKLDNS 4 77 



42 0 STGIQNRAGVLFFLTTNQCFSSVSAVELFVVEKKLFIHEYISGYYRVSSYFLGKLLSDLL 4 79 

I I I I I I : : : I : : I : I : : I : I I I I I I I : : 

478 PRGIQERLGFFAFAMSTTFYTCAEAIPVFLQERYIFMRETAYNAYRRSSYVLAHTIIS-I 536 



48 0 PMTMLPS I IFTCI VYFMLGLKPKADAFFVMMFTLMMVAYSASSMALAIAAGQS VVSVATL 53 9 
I : : I I : : I I : : I II:: : : I I : : II: 

53 7 PALIILSAAFAASTFSAVGLAGGSEGFLFFFFTILTAFWAGSSFVTFLSGVVSHVMIGFT 5 96 



54 0 LMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTALQHNEFLGQNFC — PGLNA 597 
: : : : : I I I : : I : I I I : : I : : III I I : 

59 7 WVAILAYFLLFSGFFISRDRIPLYWIWFHYLSLVKYPYEGVLQNEFEDPTKCFVRGIQM 6 56 



598 TGNNPCNYA TO — TGEEYLVKQGI-DLSPWG-LWKNH 630 

1:1 I I I I : I : I I I :: I I I I 
65 7 FDNSPLGQVPTAVKISLLKSMSGVLGINVTAETCVTTGIDILKQQGITEISKWNCLW 713 



631 VALACMIVIFLTIA YLKLLF 650 

: I : I : : : I I 

714 ITVAWGFFFRVLF 726 



RESULT 10 
H96552 

hypothetical protein F5D21.8 [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 09-Jul-2004 
C;Accession: H96552 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, 0.; Alon 
J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; Chan, A.; Chao, Q 
Chen, H.; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L.; Conway, A.B.; Conway, A.R.; 
Creasy, T.H.; Dewar, K.; Dunn, P.; Etgu, P.; Feldblyum, T.V.; Feng, J.; Fong, B.; Fuj 
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Y.; Gill, J.E.; Goldsmith, A.D.; Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C.; Khan, S.; Khaykin, E.; Kim, C.J.; 
Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B.; Langin-Hooper , S.; Lee, A.; 
Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y.; Lin, X.; Liu, S.X.; Liu, Z.A.; Luros, J.S.; Maiti, 
R.; Marziali, A.; Militscher , J.; Miranda, M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; 
Pal, G.; Peterson, J.; Pham, P.K.; Rizzo, M. ; Rooney, T.; Rowley, D.; Sakano, H. 
A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; Tallon, L. 
J.; Tambunga, G.; Toriumi, M.J.; Town, C.D.; Utterback, T.; van Aken, S.; Vaysberg, M.; 
Vysotskaia, V.S.; Walker, M.; Wu, D.; Yu, G.; Eraser, C.M.; Venter, J.C.; Davis, R.W. 
A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 
A;Reference number: A86141; MUID : 21 0 16 71 9 ; PMID : 1 1 1 3 0 71 2 
A;Accession: H96552 
A; Status: preliminary 
A;Molecule type: DNA 
A;Residues: 1-678 <STO> 

A;Cross-references : UNIPROT : Q9C8 J8 ; UNIPARC : UPI 0 00 0 0A045D ; GB:AE005173; NID : gl 00 923 6 1 ; PIDN: 
AAG12770.1; GSPDB : GNO 01 41 
C; Genetics : 
A;Gene: E5D21.8 
A;Map position: 1 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein E12L6.1; ATP- 
binding cassette homology 

Query Match 22.0%; Score 739; DB 2; Length 678; 

Best Local Similarity 30.3%; Pred. No. 5.8e-44; 

Matches 186; Conservative 137; Mismatches 248; Indels 42; Gaps 13; 

Qy 61 KEILSNINGIMKPG-LNAILGPTGGGKSSLLDVLAARKDPSG LSGDVLINGAPRPAN 116 

I : I : : I I : I : I I : I I : I I I I : I I I I I I : I : I I I I : I I I : 
Db 28 KRLLNGVNGCGEPNRILAIMGPSGSGKSTLLDALAGRL — AGNWMSGKVLVNGKKRRLD 85 

Qy 117 EKCNSGYWQDDVVMGTLTVRENLQESAALRLATTMTNHEKNERINRVIQELGLDKVADS 176 

I : I I I : I I : : I I I I I I I : : : I I I I I : : I I : : : | : : | | : : : | 

Db 86 EGA-AAYVTQEDVLLGTLTVRESISYSAHLRLPSKLTREEISDIVEATITDMGLEECSDR 144 

Qy 177 KVGTQEIRGVSGGERKRTSIGMELITDPSILELDEPTTGLDSSTANAVLLLLKRMSKQGR 236 

: I : I I : I I I I : I I II : I : : I I I : I I I I I I I : I I I I :: I I : : I : : : I : 

Db 145 TIGNWHLRGISGGEKKRLSIALEVLTKPSLLELDEPTSGLDSASAEEWQILRNIASSGK 204 

Qy 237 TIIESIHQPRYSIEKLEDSLTLLASGRLMEHGPAQEALGYEESAGYHCEAYNNPADEELD 296 

I :: I I I I I : I I I I I I I : I : : I I : I : I I I : I : I I : I I I 
Db 205 TVVSSIHQPSGEVEALEDDLLLLSGGETVYEGEAESATKEEGEAGEPCPSRRNPSDHELR 264 

Qy 297 IINGD STAVALNR EEDEKATEIIEPSKQDKPLIEKLAEIYVNSSEYKETKAEL 349 

: I I : I : : I : I : : I : II:: : : : : : I I 

Db 265 CVNSDEDNVTAALVESRRINDSSESLHQLHETTNTLDPL-DDIPTAEIRTTLVRKEKCSL 323 

Qy 350 HQLSGGEKKKKITVE KEISYTTSECHQLRWVSKRSEKNLLGNPQASIAQIIVTV 403 

: : : : : I : : I : : I I I : : : I I I I : : : I I : 

Db 324 YAAASRARIQEIASIVGIVTERKKGSQTNWWKQLRILTQRSEINMSRDLGYYWMRIAVYI 383 

Qy 404 VLGLVIGAIYEGLKNDSTGIQNRAGVLEELTTNQCESSVSAVELEWEKKLEIHEYISGY 463 

I I : : I : I : I : : I : : I I : II: : | : | | : | I : : I : 

Db 384 VLSICVGSIEENVGRNHTNVMSTAACGGEMAGEMTEMSIGGEQSEIEEMKVESRERLNGH 443 

Qy 46 4 YRVSSYELGKLLSDLLPMTMLPSIIETCIVYEMLGLKPKADAEEVMMETLMMVAYSASSM 523 
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44 4 YGVAVYTVSNLLSS-LPFIILMCLSTSSITIYMVRFQSGGSHFFYNCLDLICAITTVESC 5 02 



52 4 ALAIAAGQSWSVATLLMTIC FVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFT 5 79 

: I I : I II: : : : I : I : I I II 

503 MMMIAS WPNFLMGVMLGAGYIGIMVLSAGFFRFFPDLPMVFWRYPVSYINYGAW 557 



58 0 ALQ HNEFLGQNFCPGLNATGNNPCNYATCTGEEYLVKQGIDLSPWGLWKNHVALACM 6 36 

III I I : I : : : I | :::::: | : I : I 

558 ALQGAYKNEMIGVEY DSPLPLVPKMKGELILQTVLGINPESSKWLDLAWMM 609 



637 IVIFLTIAYLKLL 649 

1:1 II: : I 
610 ILIGYRIAFFAIL 622 



RESULT 11 
T47650 

ABC transporter-like protein - Arabidopsis thaliana 

N; Alternate names: protein T15C9.110 

C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 20-Apr-2000 #sequence_revision 20-Apr-2000 #text_change 09-Jul-2004 
C;Accession: T47650 

R;Mewes, H.W.; Rudd, S.; Lemcke, K.; Mayer, K.F.X. 

submitted to the Protein Sequence Database, April 2000 

A;Reference number: Z24470 

A;Accession: T47650 

A; Status: preliminary 

A;Molecule type: DNA 

A;Residues: 1-708 <MEW> 

A;Cross-references : UNIPROT : Q9M2V5 ; UNIPARC : UPI 0 00 0 0A62 7B ; EMBL : AL132970 
A; Experimental source: cultivar Columbia; BAC clone T15C9 
C ; Genetics : 
A;Map position: 3 
A;Note: T15C9.110 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein F12L6.1; ATP- 
binding cassette homology 

Query Match 21.7%; Score 726.5; DB 2; Length 708; 

Best Local Similarity 30.1%; Pred. No. 4.7e-43; 

Matches 198; Conservative 123; Mismatches 282; Indels 55; Gaps 15; 

Qy 36 VLSFHNICYRVKLKSGFLPCRKPVE — KEILSNINGIMKPG-LNAILGPTGGGKSSLLDV 92 

: I I I : I : I I I : I I : I : I : I I : | : | : | | : | | | | : | : | 

Db 62 LLSFNNLSYNVVLRRRFDFSRRKTASVKTLLDDITGEARDGEILAVLGGSGAGKSTLIDA 121 



93 LAARKDPSGLSGDVLINGAP — RPANFKCNSGYVVQDDVVMGTLTVRENLQFSAALRLAT 150 
III 111:11 : I 111:111:: | | | : | | | : : | | 

122 LAGRVAEDSLKGTVTLNGEKVLQSRLLKVISAYVMQDDLLFPMLTVKETLMFASEFRLPR 181 



151 TMTNHEKNERINRVIQELGLDKVADSKVGTQFIRGVSGGERKRTSIGMELITDPSILFLD 210 
: : : I I I : : I : I I I I I : : I : I I I I I I I I : I I I I ::: I I I : I I I I 

182 SLPKSKKMERVETLIDQLGLRNAADTVIGDEGHRGVSGGERRRVSIGIDIIHDPILLFLD 2 41 



211 EPTTGLDSSTANAVLLLLKRMSKQGRTIIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPA 2 70 
I I I : I I I I : I I : : I I I : : : I : I I I I I I I | | | : | : | : : | : | 

242 EPTSGLDSTNAFMWQVLKRIAQSGSWIMSIHQPSARIIGLLDRLIILSHGKSVFNGSP 3 01 
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271 QEALGYFESAGYHCEAYNNPADFFLDII NGDSTAVALNREEDFKATEI lEPSKQD 3 25 

: I I I I : I I I : I : : | : I : : : : I 

3 02 VSLPSFFSSFGRPIPEKENITEFALDVIRELEGSSEGTRDLVEFNEKWQQNQTARATTQS 361 

326 KPLIEKLAEIYVNSSFYKETKAELHQLSGGEKKKKITVFKEISYTTSFCHQLRWVSKRSF 3 85 
: : : : I : : : | : | | | : : | | : : : | | 

362 RVSLKEAIAASV SRGKL — VSGSSGANPISMETVSSYANPPLAETFILAKRYI 412 

3 86 KNLLGNPQASIAQIIVTWLGLVIGAIYFGLKNDSTGIQNRAGVLFFLTTNQCFSSVSAV 4 45 

I I : I : : I : I I I : : : I : I I I I I I I : : : 

413 KNWIRTPELIGMRIGTVMVTGLLLATVYWRLDNTPRGAQERMGFFAFGMSTMFYCCADNI 4 72 

446 ELFVVEKKLFIHEYISGYYRVSSYFLGKLLSDLLPMTMLPSIIFTCIVYFMLGLKPKADA 5 05 
: I : I : : I : I Mill: I II : III : : : I I : : 

4 73 PVFIQERYIFLRETTHNAYRTSSYVISHALVS-LPQLLALSIAFAATTFWTVGLSGGLES 531 

506 FFVMMFTLMMVAYSASSMALAIAAGQSVVSVATLLMTICFV-FMMIFSGLLVNLTTIASW 56 4 

II : : I I I : I : I : : : : | | : : : : : | : | | : 

53 2 FFYYCLIIYAAFWSGSSIVTFI-SGLIPNVMMSYMVTIAYLSYCLLLGGFYINRDRIPLY 5 90 

565 LSWLQYFSIPRYGFTALQHNEFLGQNFC PGLNATG 599 

I I I : : I : I : I I I : I I : : 

591 WIWFHYISLLKYPYEAVLINEFDDPSRCFVKGVQVFDGTLLAEVSHVMKVKLLDTLSGSL 6 50 

600 NNPCNYATC — TGEEYLVKQGI-DLSPWG-LWKNHVALACMIVIFLTIAYLKLLFLKK 6 53 

: I I I I : I : : I I I III II : I I : I : I I I I I I 
651 GTKITESTCLRTGPDLLMQQGITQLSKWDCLW ITLAWGL-FFRILFYLSLLFGSK 7 04 



RESULT 12 
T08934 

hypothetical protein F27G19.20 - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: ll-Jun-1999 #sequence_revision ll-Jun-1999 #text_change 09-Jul-2004 
C;Accession: T08934 

R;Bevan, M.; Hilbert, H.; Braun, M. ; Holzer, E.; Brandt, A.; Duesterhoef t , A.; Bancroft, 

I.; Mewes, H.W.; Mayer, K.F.X.; Lemcke, K.; Schueller, C. 

submitted to the Protein Sequence Database, May 1999 

A;Reference number: Z16519 

A;Accession: T08934 

A;Molecule type: DNA 

A;Residues: 1-635 <BEV> 

A;Cross-references : UNIPROT : Q9S ZR9 ; UNIPARC : UPI00000A1E91 ; EMBL : AL078467; GSPDB : GN00062 ; 
ATSP:F27G19.20 

A; Experimental source: cultivar Columbia; BAG clone F27G19 
C; Genetics : 

A;Gene: ATSP : F2 7G1 9 . 2 0 
A;Map position: 4 

A;Introns: 38/3; 253/1; 304/1; 414/3 

C; Superf amily : fruit fly white protein; ATP-binding cassette homology 

Query Match 21.5%; Score 721; DB 2; Length 635; 

Best Local Similarity 30.0%; Pred. No. 9.7e-43; 

Matches 203; Conservative 121; Mismatches 284; Indels 68; Gaps 18; 
Qy 1 MSSSNVEVFIPVSQGNTNGFPATASNDLKAFTEGAVLSFHNICYRVKLKSG FLPCRK 5 7 
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I : I : : II : : I I I I : I I I I I I I 

1 MDNQEVSMDVETPIAKTNDDRSLPFSIFKKANNPVTLKFENLVYTVKLKDSQGCFGKNDK 6 0 

5 8 PVEKEILSNINGIMKPG-LNAILGPTGGGKSSLLDVLAAR — KDPSGLSGDVLINGAPRP 114 

1:11 : I I : I I I : I : I I I : I I I : I I I I I : I : I : : I I 

61 TEERTILKGLTGIVKPGEILAMLGPSGSGKTSLLTALGGRVGEGKGKLTGNISYNNKPLS 120 

115 ANFKCNSGYWQDDVVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVA 174 

I : I : I I I I : I I I I I I : I I I I : II:: I : I I I I I : 

121 KAVKRTTGFVTQDDALYPNLTVTETLVFTALLRLPNSFKKQEKIKQAKAVMTELGLDRCK 180 

175 DSKVGTQFIRGVSGGERKRTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQ 23 4 
I : : I 1:1111111111 III I : : : I I : I I I I I I I : I I I I : I I : : : I : : : 

181 DTIIGGPFLRGVSGGERKRVSIGQEILINPSLLFLDEPTSGLDSTTAQRIVSILWELARG 2 40 

23 5 GRTIIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPAQEALGYFESAGYHCEAYN-NPADF 2 93 

I I I : : : I I I I : | : : | I : I I I I I I I : I I 

241 GRTVVTTIHQP SKGNPVYFGLGSNAMDYFASVGYSPLVERINPSDF 286 

294 FLDIINGDSTAVALNREEDFKATEIIEPSKQDKPLIEKLAEIYVNSSFYK E 344 

I I I I I I : : : I II: : I I I I 

287 LLDIANGKPLLV ISCWPSVGSDESQRPEAMKAALV AFYKTNLLDSVINE 335 

3 45 TKAELHQLSGGEKKKKITVFKEISYTTSFCHQLRWVSKRSFKNLLGNPQA — SIAQIIVT 4 02 
I : : : : : : I : : I : I I I : : : I I I 

33 6 VKGQDDLCNKPRESSRVATNTYGDWPTTWWQQFCVLLKRGLKQRRHDSFSGMKVAQIF — 3 93 

403 WLGLVIGAIYFGLKNDSTGIQNRAGVLFFLTTNQCFSSV-SAVELFWEKKLFIHEYIS 461 
: : : I : : : I : : | : : | : | | | : : : | : : | | : : | | 

394 -IVSFLCGLLWWQTK — ISRLQDQIGLLFFISSFWAFFPLFQQIFTFPQERAMLQKERSS 450 

46 2 GYYRVSSYFLGKLLSDLLPMTMLPSIIFTCIVYFMLGLKPKADAFFVMMFTLMMVAYSAS 521 

I I I : I I I I : : : I I I I : : I I I : I I I III: I : : : 

451 GMYRLSPYFLSRVVGD-LPMELILPTCFLVITYWMAGLNHNLANFFVTLLVLLVHVLVSG 5 09 

522 SMALAIAAGQSWSVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTAL 5 81 
: I I : I III:: : : : I I : : : I I : : I I I I : I 

510 GLGLALGALVMDQKSATTLGSVIMLTFLLAGGYYVQHVPV — FISWIKYVSIGYYTYKLL 56 7 

582 QHNEFLGQNFCPGLNATGNN PCNYATCTGEEYL-VKQGIDLSPWGLWKNHVALACMI 63 7 

III I 1:1 I : I : : : I : : : | | | : 

56 8 ILGQYTANELYPCGDNGKLRCHVGDFEGIKHIGFNSGL VSALALTAML 615 

638 VIFLTIAYLKLLFLKK 653 
I : : 111:1 : I 

616 WYRVIAYIALTRIGK 631 



RESULT 13 
G84791 

probable ABC transporter [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Feb-2001 #sequence_revision 02-Feb-2001 #text_change 09-Jul-2004 
C;Accession: G84791 

R;Lin, X.; Kaul, S.; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, C.D.; Fujii, C.Y.; 
Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.V.; Buell, C.R.; Ketchum, K.A.; 
Lee, J.J.; Ronning, C.M.; Koo, H.; Moffat, K.S.; Cronin, L.A.; Shen, M.; VanAken, S.E.; 
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Umayam, L.; Tallon, L.J.; Gill, J.E.; Adams, M.D.; Carrera, A.J.; Creasy, T.H.; Goodman, H. 
M.; Somerville, C.R.; Copenhaver, G.P.; Preuss, D.; Nierman, W.C.; White, 0.; Eisen, J. A.; 
Salzberg, S.L.; Eraser, C.M.; Venter, J.C. 
Nature 402, 761-768, 1999 

A; Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. 

A;Reference number: A84420; MUID : 20 0 83 48 7 ; PMID : 1 06 1 7 1 9 7 

A;Accession: G84791 

A; Status: preliminary 

A;Molecule type: DNA 

A;Residues: 1-755 <STO> 

A;Cross-references : UNIPROT : Q9ZUT0 ; UNIPARC : UPI00000A1BA9 ; GB:AE002093; NID : g4056489; PIDN: 
AAC98055.1; GSPDB : GN00139 
C ; Genetics : 
A;Gene: At2g37360 
A;Map position: 2 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein E12L6.1; ATP- 
binding cassette homology 

Query Match 21.5%; Score 720.5; DB 2; Length 755; 

Best Local Similarity 28.3%; Pred. No. 1.3e-42; 

Matches 202; Conservative 131; Mismatches 282; Indels 99; Gaps 21; 

Qy 2 SSSNVEVEIPVSQGNTNGEPATASNDLKAETEGA-VLSEHNICYRVKLKSGELP CRK 5 7 

II : : I : : : I : I I : : | | | | : : | | | : : | | | | : 

Db 63 SSRALGIASPINSA-ASSENSWASAPASSISSSPEVLSETDLTYSVKIQKKENPLACCRR 121 



58 PVE KEILSNINGIMKPG-LNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLING 110 

I : I : I : I : I : I : I I : I I I I : I : I I I I | | : : | | 

122 SGNDSSVNTKILLNGISGEAREGEMMAVLGASGSGKSTLIDALANRIAKDSLRGSITLNG 181 



111 APRPANE-KCNSGYVVQDDWMGTLTVRENLQESAALRLATTMTNHEKNERINRVIQELG 169 

: : I 111:111:: I I I I I I I I II : : : : I I : : I : I I 
182 EVLESSMQKVISAYVMQDDLLEPMLTVEETLMESAEERLPRSLSKKKKKARVQALIDQLG 2 41 



Qy 170 LDKVADSKVGTQEIRGVSGGERKRTSIGMELITDPSILELDEPTTGLDSSTANAVLLLLK 229 

I I : : I : I I I I I I I I : I I I I :: I I I I I I I I I I I : I I I I :: I I : : I : 
Db 242 LRSAAKTVIGDEGHRGVSGGERRRVSIGNDIIHDPIILELDEPTSGLDSTSAYMVIKVLQ 301 



Qy 230 RMSKQGRTIIESIHQPRYSIEKLEDSLTLLASGRLMEHGPAQEALGYEESAGYHCEAYNN 289 

I : : : I : I I I I I I I I III I : I : : I : I : I 

Db 302 RIAQSGSIVIMSIHQPSYRIMGLLDQLIELSKGNTVYSGSPTHLPQEESEEKHPIPENEN 361 



Qy 290 PADEELDIINGDSTAVALNREEDEKATEIIEPSKQDKPLIE KLAEIYVN 338 

: I I I : I II:: : : I I I : I I I I I 

Db 362 KTEEALDLI RELEYS TEGTKPLVEFHKQWRAKQAPS YNNNNKR 404 



33 9 SSFYKETKAELHQ LSGGEKKKKITVFKEI-SYTTSFCHQLRWVSKRSFKNLLG 3 90 

II: I : : : I I : : : I : : : I I : I 

405 NTNVSSLKEAITASISRGKLVSGATNNNSSNLTPSFQTFANPFWIEMIVIGKRAILNSRR 46 4 



391 NPQASIAQIIVTVVLGLVIGAIYFGLKNDSTGIQNRAGVLFFLTTNQCFSSVSAVELFW 450 
I : : : : I I : : : : : II I I I I I : : : I : : I : 

46 5 QPELLGMRLGAVMVTGIILATMFTNLDNSPKGAQERLGFFAFAMSTTFYTCAEAIPVFLQ 524 



451 EKKLFIHEYISGYYRVSSYFLGKLLSDLLPMTMLPSIIFTCIVYFMLGLKPKADAFFVMM 510 
I : : I : I MINI:: : I : : I I : : : I I I : I I 

525 ERYIFMRETAYNAYRRSSYVLSQSIIS-IPALIVLSASFAATTFWAVGLDGGANGFFFFY 5 83 
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Qy 511 FTLMMVAYSASSMALAIAAGQSWSVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQY 5 70 

II:: : : I I : : I : : : : : : I I I : : I : I I 

Db 584 FTILASFWAGSSFVTFLSGVIPNVMLGFTVWAILAYFLLFSGFFISRDRIPVYWLWFHY 643 

Qy 571 FSIPRYGFTALQHNEFLGQN FCPGLNATGNNP C 6 03 

I : : I : : III II I I : I : I 

Db 644 ISLVKYPYEGVLQNEF — QNPTRCFARGVQLFDNSPLGEFPNDVKVNLLKSMSGVLGTNV 701 

Qy 604 NYATC — TGEEYLVKQGI-DLSPWG-LWKNHVALACMIVIFLTIA YLKLLF 650 

II I I : I : I I I I : I I II : I : I : : : I I 

Db 702 TAETCVTTGIDILKQQGITDISKWNCLW ITVAWGFFFRVLF 742 

RESULT 14 
T02567 

probable ATP-binding cassette protein T16B24.1 - Arabidopsis thaliana 

N; Alternate names: protein F12L6.1 

C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: lO-Sep-1999 #sequence_revision lO-Sep-1999 #text_change 09-Jul-2004 
C;Accession: T02567; T00545; C84816 

R;Rounsley, S.D.; Kaul, S.; Lin, X.; Ketchum, K.A.; Crosby, M.L.; Brandon, R.C.; Sykes, S 
M.; Mason, T.M.; Kerlavage, A.R.; Adams, M.D.; Somerville, C.R.; Venter, J.C. 
submitted to the EMBL Data Library, August 1998 

A; Description : Arabidopsis thaliana chromosome II BAC T16B24 genomic sequence. 
A;Reference number: Z14679 
A;Accession: T02567 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A;Residues: 1-740 <ROU> 

A;Cross-references : UNIPROT : 080946 ; UNIPARC : UPI00000AC233 ; EMBL : ACO 0 46 9 7 ; NID : g3 4026 7 1 ; 

PIDN:AAC28975. 1; PID:g3402672 

A; Experimental source: cultivar Columbia 

R;Rounsley, S.D.; Lin, X.; Ketchum, K.A.; Crosby, M.L.; Brandon, R.C.; Sykes, S.M.; Kaul, 
S.; Mason, T.M.; Kerlavage, A.R.; Adams, M.D.; Somerville, C.R.; Venter, J.C. 
submitted to the EMBL Data Library, July 1998 

A; Description : Arabidopsis thaliana chromosome II BAC F12L6 genomic sequence. 
A;Reference number: Z14168 
A;Accession: T00545 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A;Residues: 1-362 <ROW> 

A;Cross-references : UNIPARC : UPI 00 00 1 746D9 ; EMBL : ACO 0421 8 ; NID : g3355463 ; PIDN : AAC27826 . 1 ; 
PID:g3355464 

A; Experimental source: cultivar Columbia 

R;Lin, X.; Kaul, S.; Rounsley, S.D.; Shea, T.P.; Benito, M.I.; Town, CD.; Fujii, C.Y.; 
Mason, T.M.; Bowman, C.L.; Barnstead, M.E.; Feldblyum, T.V.; Buell, C.R.; Ketchum, K.A.; 
Lee, J.J.; Ronning, CM.; Koo, H.; Moffat, K.S.; Cronin, L.A.; Shen, M.; VanAken, S.E.; 
Umayam, L.; Tallon, L.J.; Gill, J.E.; Adams, M.D.; Carrera, A.J.; Creasy, T.H.; Goodman, 
M.; Somerville, C.R.; Copenhaver, CP.; Preuss, D.; Nierman, W.C; White, 0.; Eisen, J. A. 
Salzberg, S.L.; Eraser, CM.; Venter, J.C. 
Nature 402, 761-768, 1999 

A; Title: Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. 
A;Reference number: A84420; MUID : 20 0 83 48 7 ; PMID : 1 06 1 7 1 9 7 
A;Accession: C84816 
A; Status: preliminary 
A;Molecule type: DNA 
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A;Residues: 1-740 <STO> 

A;Cross-references : UNIPARC : UPI 00 00 0AC23 3 ; GB:AE002093; NID : g3402672; PIDN : AAC28975 . 1 ; 
GSPDB: GN00139 
C; Genetics : 

A;Gene: At2g39350; T16B24.1; F12L6.1 
A;Map position: 2 

C; Superf amily : Arabidopsis thaliana probable ATP-binding cassette protein F12L6.1; ATP- 
binding cassette homology 
C; Keywords: ATP 

F; 110-310/Domain: ATP-binding cassette homology <ABC> 

Query Match 21.4%; Score 717.5; DB 1; Length 740; 

Best Local Similarity 28.0%; Pred. No. 2.1e-42; 

Matches 193; Conservative 120; Mismatches 294; Indels 83; Gaps 16; 

Qy 26 NDLKAFTEGAVLSFHNICYRVKLK SGFLPCR KPVEKEILSNING 6 9 

II I 11111:11:: II : I I : I : I I : I 

Db 56 NDGYMRTVPFVLSFDNLTYNVSVRPKLDFRNLFPRRRTEDPEIAQTARPKTKTLLNNISG 115 

Qy 70 IMKPG-LNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLINGAPRPAN-FKCNSGYWQD 127 

: I : I : I I : I I I I : I : I I I I I I I : I I : I I I I : I I 

Db 116 ETRDGEIMAVLGASGSGKSTLIDALANRIAKGSLKGTVKLNGETLQSRMLKVISAYVMQD 175 

Qy 128 DVVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVADSKVGTQFIRGVS 187 

I : : 111111:1 II : : : I I : : I : I I : I : : I : I I : I 
Db 176 DLLFPMLTVEETLMFAAEFRLPRSLPKSKKKLRVQALIDQLGIRNAAKTIIGDEGHRGIS 235 

Qy 188 GGERKRTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSKQGRTIIFSIHQPRY 247 

1111:1 I I I : : : I II : I I I I I I I : I I I I :: I I : : I I I : : : I : I I I I I I : 

Db 236 GGERRRVSIGIDIIHDPILLFLDEPTSGLDSTSAFMVVKVLKRIAQSGSIVIMSIHQPSH 295 

Qy 248 SIFKLFDSLTLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADFFLDII NG 300 

: I I I I : I : : I : I I I : I I I : I I 

Db 296 RVLGLLDRLIFLSRGHTVYSGSPASLPRFFTEFGSPIPENENRTEFALDLIRELEGSAGG 355 

Qy 301 DSTAVALNREEDFKATEIIEPSKQDKPLIEKLAEIYVNSSFYKETKAELHQ LSGGEK 357 

: I : I I : : I : II : II: : I : : : I I I I 

Db 356 TRGLIEFNK KWQEMKKQSNRQPPLTPP-SSPYPNLTLKEAIAASISRGKLVSGGES 410 

Qy 358 KKKITVFKEISYTTSFCHQLRWVSKRSFKNLLGNPQASIAQIIVTWLGLVIGA 411 

I : : : : : : I I I I I I : : I I : I : : 

Db 411 VAHGGATTNTTTLAVPAFANPMWIEIKTLSKRSMLNSRRQPELFGIRIASVVITGFILAT 4 70 

Qy 412 lYFGLKNDSTGIQNRAGVLFFLTTNQCFSSVSAVELFWEKKLFIHEYISGYYRVSSYFL 471 

: : : I I I : I I I I : : : I : : I : I : : I : I I I I I I I 

Db 471 VFWRLDNSPKGVQERLGFFAFAMSTMFYTCADALPVFLQERYIFMRETAYNAYRRSSYVL 530 

Qy 472 GKLLSDLLPMTMLPSIIFTCIVYFMLGLKPKADAFFVMMFTLMMVAYSASSMALAIAAGQ 531 

: 1:1:1 I : : I I : : : I I I : : 

Db 531 SHAIVS-FPSLIFLSVAFAATTYWAVGLDGGLTGLLFYCLIILASFWSGSSFVTFLSGW 589 

Qy 532 SVVSVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGFTALQHNEFLGQNF 591 

I : : : : : : I I I : I I : I I I : : I : I : I I I 

Db 590 PSVMLGYTIWAILAYFLLFSGFFINRNRIPDYWIWFHYMSLVKYPYEAVLQNEFSDATK 649 

Qy 592 C — PGLNATGNNP CNYATC — TGEEYLVKQG-IDLSP 623 
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Db 650 CFVRGVQIFDNTPLGELPEVMKLKLLGTVSKSLGVTISSTTCLTTGSDILRQQGWQLSK 709 



Qy 62 4 WGLWKNHVALACMIVIFLTIA YLKLLF 6 50 

I : I : I : I : : : I I 

Db 710 WN CLFITVAFGFFFRILF 727 



RESULT 15 
E96742 

probable ABC transporter F17M19.11 [imported] - Arabidopsis thaliana 
C; Species: Arabidopsis thaliana (mouse-ear cress) 

C;Date: 02-Mar-2001 #sequence_revision 02-Mar-2001 #text_change 09-Jul-2004 
C;Accession: E96742 

R;Theologis, A.; Ecker, J.R.; Palm, C.J.; Federspiel, N.A.; Kaul, S.; White, 0.; Alonso, 
J.; Altaf, H.; Araujo, R. ; Bowman, C.L.; Brooks, S.Y.; Buehler, E.; Chan, A.; Chao, Q.; 
Chen, H.; Cheuk, R.F.; Chin, C.W.; Chung, M.K.; Conn, L.; Conway, A.B.; Conway, A.R.; 
Creasy, T.H.; Dewar, K.; Dunn, P.; Etgu, P.; Feldblyum, T.V.; Feng, J.; Fong, B.; Fujii, C. 
Y.; Gill, J.E.; Goldsmith, A.D.; Haas, B.; Hansen, N.F.; Hughes, B.; Huizar, L. 
Nature 408, 816-820, 2000 

A;Authors: Hunter, J.L.; Jenkins, J.; Johnson-Hopson, C; Khan, S.; Khaykin, E.; Kim, C.J.; 
Koo, H.L.; Kremenetskaia, I.; Kurtz, D.B.; Kwan, A.; Lam, B.; Langin-Hooper , S.; Lee, A.; 
Lee, J.M.; Lenz, C.A.; Li, J.H.; Li, Y.; Lin, X.; Liu, S.X.; Liu, Z.A.; Luros, J.S.; Maiti, 
R.; Marziali, A.; Militscher , J.; Miranda, M.; Nguyen, M. ; Nierman, W.C.; Osborne, B.I.; 
Pai, G.; Peterson, J.; Pham, P.K.; Rizzo, M. ; Rooney, T.; Rowley, D.; Sakano, H. 
A;Authors: Salzberg, S.L.; Schwartz, J.R.; Shinn, P.; Southwick, A.M.; Sun, H.; Tallon, L. 
J.; Tambunga, G.; Toriumi, M.J.; Town, CD.; Utterback, T.; van Aken, S.; Vaysberg, M.; 
Vysotskaia, V.S.; Walker, M.; Wu, D.; Yu, G.; Eraser, CM.; Venter, J.C; Davis, R.W. 
A; Title: Sequence and analysis of chromosome 1 of the plant Arabidopsis. 
A;Reference number: A86141; MUID : 21 0 16 71 9 ; PMID : 1 1 1 3 0 71 2 
A;Accession: E96742 
A; Status: preliminary 
A;Molecule type: DNA 
A;Residues: 1-609 <STO> 

A;Cross-references : UNIPROT : Q9C8W6 ; UNIPARC : UPI000009EF81 ; GB:AE005173; NID : g6 9 7 892 1 ; PIDN: 
AAF34313.1; GSPDB : GNO 01 41 
C; Genetics : 
A;Gene: F17M19.11 
A;Map position: 1 

C; Superf amily : fruit fly white protein; ATP-binding cassette homology 

Query Match 21.3%; Score 712.5; DB 2; Length 609; 

Best Local Similarity 31.0%; Pred. No. 3.6e-42; 

Matches 190; Conservative 125; Mismatches 256; Indels 41; Gaps 16; 

Qy 56 RKPVEKEILSNINGIMKPG-LNAILGPTGGGKSSLLDVLAARKDPSGLSGDVLINGAPRP 114 

I I : III : I : : II I : I I I : I I I I : I I : : I I I I : I : I I I 
Db 23 RSTEERTILSGVTGMISPGEFMAVLGPSGSGKSTLLNAVAGRLHGSNLTGKILINDGKIT 82 



115 ANFKCNSGYWQDDVVMGTLTVRENLQFSAALRLATTMTNHEKNERINRVIQELGLDKVA 174 
: I : I I I I : : I I I I I I I I I I I : : I I I I I I I I I 

83 KQTLKRTGFVAQDDLLYPHLTVRETLVFVALLRLPRSLTRDVKLRAAESVISELGLTKCE 142 



175 DSKVGTQFIRGVSGGERKRTSIGMELITDPSILFLDEPTTGLDSSTANAVLLLLKRMSK- 233 

: : I I 1111:111111111 I I : : I I : I I I I I I : I I I : : I : : I : : 
143 NTWGNTFIRGISGGERKRVSIAHELLINPSLLVLDEPTSGLDATAALRLVQTLAGLAHG 2 02 



Qy 234 QGRTIIFSIHQPRYSIFKLFDSLTLLASGRLMFHGPAQEALGYFESAGYHCEAYNNPADF 293 
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: I : I : : I I I I I : I : : I I : : I I : I : : I I : : | : | | | | | : | | | | | 

203 KGKTVVTSIHQPSSRVFQMFDTVLLLSEGKCLFVGKGRDAMAYFESVGFSPAFPMNPADF 262 

29 4 FLDIING — DSTAVALNREEDFKATEIIEPSKQDKPLIEKLAEI YVNSSFYKETKA 3 47 

11:11 : I : : : I : I : : I : I : I I 
26 3 LLDLANGVCQTDGVTEREKPNVRQTLVTAYDTLLAPQVKTCIEVSHFPQDNARFVKT 319 

348 ELHQLSGGEKKKKITVFKEISYTTSFCHQL-RWVSKRSFKNLLGNPQASIAQIIVTWLG 4 06 
: : : I I I : : : | | | : : | : : : : | | 

320 RVNGGGITTCIA TWFSQLCILLHRLLKERRHESF DLLRIFQVVAAS 365 

40 7 LVIGAIYFGLKNDSTGIQNRAGVLFFLTTN-QCFSSVSAVELFWEKKLFIHEYISGYYR 465 
: : I : : : : I : : I I : I I I : : I : I I I I : : I I I I I 

36 6 ILCGLMWW — HSDYRDVHDRLGLLFFISIFWGVLPSFNAVFTFPQERAIFTRERASGMYT 423 

46 6 VSSYFLGKLLSDLLPMTMLPSIIFTCIVYFMLGLKPKADAFFVMMFTLMMVAYSASSMAL 525 
: I I I I : : I I : I I : I I : I : I : I I : : I : : : : : I 

42 4 LSSYFMAHVLGSLSMELVLPASFLT-FTYWMVYLRPGIVPFLLTLSVLLLYVLASQGLGL 4 82 

526 AIAAGQSWSVATLLMTICFVFMMIFSGLLVNLTTIASWLSWLQYFSIPRYGF TALQ 5 82 

1:1 I : : : I : : : : I II : I : I : : I I I : I : I 

483 ALGAAIMDAKKASTIVTVTMLAFVLTGGYYVN — KVPSGMVWMKYVSTTFYCYRLLVAIQ 5 40 

583 HNEFLGQNFCP — GLNATGNNPCNYATCTGEEYLVKQGIDLSPWGLWKNHVALACMIVIF 6 40 
: I : I : : I : | | | : : : : | | : | : | | : 

541 YGS — GEEILRMLGCDSKGKQGASAATSAGCRFVEEEVI — GDVGMWTSVGVLFLMFFGY 5 96 

641 LTIAYLKLLFLK 652 

: I I I I : I 
59 7 RVLAYLALRRIK 608 



Search completed: September 18, 2008, 22:09:13 
Job time : 43 sees 
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